Paraphrase generation and information retrieval from stored text
نویسنده
چکیده
First the notion "paraphrase" is defined, and then several different types of paraphrase are analyzed: transformational, attenuated, lexical, deriva-tional, and real-world. Next, several different methods of retrieving information are discussed utilizing the notions of paraphrase defined previously. It is concluded that a combination keyword-keyphrase method would constitute the optimum procedure.
منابع مشابه
A Deep Generative Framework for Paraphrase Generation
Paraphrase generation is an important problem in NLP, especially in question answering, information retrieval, information extraction, conversation systems, to name a few. In this paper, we address the problem of generating paraphrases automatically. Our proposed method is based on a combination of deep generative models (VAE) with sequence-to-sequence models (LSTM) to generate paraphrases, giv...
متن کاملInformation Retrieval based on Paraphrase
Text Retrieval systems based on ranking use similarity as an approximation to relevance. Most of these systems ignore word meaning. We assume that some measure of paraphrase would be a better similarity measure. We develop a concept of paraphrase based on Meaning-Text Theory and implement an approximation to the ideal using the Longman Dictionary of Contemporary English (LDOCE). The performance...
متن کاملCitances: Citation Sentences for Semantic Analysis of Bioscience Text
We propose the use of the text of the sentences surrounding citations as an important tool for semantic interpretation of bioscience text. We hypothesize several different uses of citation sentences (which we call citances), including the creation of training and testing data for semantic analysis (especially for entity and relation recognition), synonym set creation, database curation, documen...
متن کاملOn the Mono- and Cross-Language Detection of Text Re-Use and Plagiarism
Automatic text re-use detection is the task of determining whether a text has been produced by considering another as its source. Plagiarism, the unacknowledged re-use of text, is probably the most famous kind of re-use. Favoured by the easy access to information through electronic media, plagiarism has raised in recent years, requesting for the attention of experts in text analysis. Automatic ...
متن کاملUsing Multiple Metrics in Automatically Building Turkish Paraphrase Corpus
Paraphrasing is expressing similar meanings with different words in different order. In this sense it is viewed as translation in the same language. It is an important issue in natural language processing for automatic machine translation, question answering, text summarization and language generation. Studies in paraphrasing can be classified as paraphrase extraction, paraphrase generation, pa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Mech. Translat. & Comp. Linguistics
دوره 11 شماره
صفحات -
تاریخ انتشار 1968